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Introduction 

Control  laws  for  modem  fighter  aircraft  are  typically  developed  using  a  'point*  design 
approach.  Figure  1  is  an  illustrated  example  of  the  complexity  associated  with  this  process.  The 
engineer  begins  with  a  six  degree  of  freedom  (DOF)  nonlinear  model  which  completely  describes 
the  aerodynamics  of  the  aircraft  throughout  its  maneuvering  envelope.  The  aircraft  model  normally 
evolves  through  extensive  wind  tunnel  testing  and  computational  fluid  dynamic  analyses  and 
includes  various  uncertainties  and  inaccuracies.  A  baseline  structure  for  the  aircraft  control  system 
must  then  be  developed  which  defines  the  necessary  feedback  parameters  and  compensations. 
From  here,  the  engineer  chooses  a  single  trim  (or  stationary)  point  within  the  aircraft  flight 
envelope.  The  control  taw  design  is  effectively  centered  around  this  point.  The  aerodynamic 
model  is  then  linearized  at  the  trim  point  so  that  conventional  linear  control  theory  may  be  applied 
to  generate  gains  and  parameters  for  the  baseline  con./oi  structure.  Other  techniques  such  as 
Linear  Quadratic  Gaussian  (LOG)  or  Linear  Quadratic  Regulator  (LQR)  may  be  used  to  generate 
controller  gains  which  minimize  a  quadratic  cost  function.  Once  the  controller  gains  have  been 
generated,  the  engineer  chooses  a  different  trim  point  and  again  linearizes  the  aircraft  model  and 
generates  the  optimal  controller  gains  at  the  new  point.  This  method  is  iterated  until  a  complete 
set  of  controller  gains  has  been  generated  for  several  points  within  the  flight  envelope.  Once  the 
process  has  been  completed,  the  engineer  has  essentially  created  a  discrete  nonlinear  functional 
mapping  from  the  aircraft  envelope  into  the  desired  control  system  gains.  This  lengthy  design 
technique,  however,  limits  the  total  number  of  trim  points  the  engineer  is  able  to  investigate  for  a 
given  aircraft. 

Implementation  of  the  various  flight  control  laws  is  done  by  'scheduling*  the  controller 
gains  as  a  function  of  the  of  the  trim  point  parameters  (e.g.  Mach  number,  altitude,  wing  sweep, 
etc.).  In  flight,  these  parameters  are  measured  so  that  the  proper  controller  gains  can  be  found 
through  standard  look-up  tables  residing  in  the  Right  Control  Computer’s  (FCC)  memory. 
However,  the  trim  point  parameters  are  measured  as  continuous  variables  and  their  values 
frequently  fall  between  two  discrete  design  points  each  of  which  corresponds  to  a  particular  set  of 
controller  gains.  At  this  point  the  FCC  must  perform  time  consuming  multi-variable 
interpolations/extrapolations  to  estimate  the  values  for  the  controller  gains. 

In  the  past,  neural  networks  have  been  used  extensively  in  applications  such  as  pattern 
recognition,  functional  synthesis  and  optimization  [1],[2].  In  control  applications,  they  have  been 
used  as  adaptive  controllers  and  state  estimators  [3],  (4],  [5],  [6].  Neural  networks  offer  several 
advantages  which  may  enhance  the  performance  of  gt  ;n  scheduled  controllers  and  simplify  their 
development  process. 

For  example,  back-propagation  networks  can  Implement  nonlinear  mappings  of 
continuous  valued  inputs  through  supervised  learning.  Neural  network  dynamics  also 
display  a  fast  adaptation  rate  to  a  large  number  of  parameters  and  a  uniform  rate  of 
convergence  which  is  independent  of  network  size.  Neural  networks  exhibit  natural 
robustness  due  to  their  generalization  properties  [from  ref.  3}. 

The  objective  of  this  research  is  to  incorporate  emerging  neural  network  technology  into  a 
feasible  concept  which  will  simplify  control  system  development  by  automating  the  design  of 
controller  gains.  The  benefits  anticipated  by  developing  this  concept  are  twofold:  1)  reduce  time 
and  personnel  required  to  develop  control  laws  and  2)  increase  the  number  of  design  points  used 
in  the  flight  envelope  to  improve  controller  performance. 

Concept 

Our  approach  is  to  apply  current  neural  network  concepts  to  synthesize  a  nonlinear 
functional  relationship  between  aircraft  trim  point  parameters  and  the  desired  control  system  gains 
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for  a  fixed-structure  control  law.  The  concept,  as  Mustrated  in  figure  2,  compares  the  aircraft 
responses  with  predetermined  desired  response  models  in  the  neural  control  stage.  The  neural 
control  stage  "grades’  how  well  the  aircraft/controiier  is  responding  with  respect  to  the  desired 
response  models.  The  grade  is  then  used  to  adjust  the  neural  network  so  that  new  controller  gains 
may  be  produced  which  enable  the  aircraft/controller  to  achieve  responses  more  closely  matched 
to  the  desired  responses.  This  cycle  is  performed  repeatedly  at  each  arbitrary  trim  point  until  the 
"grade"  falls  within  allowable  tolerances. 

Figure  3  is  a  detailed  description  of  the  flight  control  law  synthesis  concept  In  block 
diagram  form.  The  following  is  a  brief  description  of  each  major  component: 

1)  The  Aircraft  Model  is  a  complete  set  of  mathematical  equations  (nonlinear  differential) 
which  describe  the  motion  of  the  aircraft  in  one  or  more  of  its  axes.  Aerodynamic  data 
which  reflect  coefficients  of  lift,  drag  and  moments  for  the  aircraft  at  different  flight 
conditions  and  configurations  accompany  the  equations  of  motion. 

2)  The  Fixed-Structure  Feedback  Controller  contains  the  control  laws  which,  in 
combination  with  the  correct  gains,  provide  the  aircraft  with  the  proper  handling  qualities 
and  stability  margins.  Control  laws  are  the  mathematical  equations  which  relate  the  pilot 
input  commands  and  sensed  aircraft  parameters  to  control  surface  deflections.  The 
fixed  structure  implies  that  input  commands  and  sensed  parameters  are  always  present 
within  the  control  laws  but,  their  "weights*  or  gains  (i.e.  the  degree  to  which  they  affect 
the  control  law)  may  vary  as  a  function  of  the  aircraft's  flight  condition. 

3)  The  Performance  Models  are  a  set  of  second  order  linear  transfer  functions  which  are 
intended  to  exemplify  the  desired  maneuvering  characteristics  of  the  aircraft/controiier. 
The  performance  models  are  designed  in  accordance  with  the  frequency  and  damping 
requirements  listed  in  MIL-F-8785C  "Military  Specification  for  Flying  Qualities  of  Piloted 
Airplanes."  The  frequency  and  damping  requirements  are  based  on  classification  (e  g. 
fighter,  transport,  etc.),  category  and  desired  flying  quality  level  for  the  specific  aircraft. 

4)  The  Test  Input  is  the  pilot  commanded  input  to  both  the  aircraft  and  the  performance 
model.  Various  test  inputs  are  used  to  excite  different  modes  in  the  aircraft  model  (e  g. 
pitch  doublet  is  used  to  excite  short  period  mode). 


5)  The  Cost  Functional  or  performance  index  is  the  mechanism  used  to  determine  the 
"mismatch"  between  the  output  of  the  performance  models  and  the  actual  response  of 
the  aircraft  at  a  given  trim  point.  Although  there  are  many  mathematical  relationships 
which  may  be  used  for  the  cost  functional  (7J,  a  common  one  in  control  system  design  is 
the  root  mean  square  error  (Erms)  given  by  the  equation: 


Where  e^t)  =  (x<j(t)  -  Xa(t)]2  is  the  square  error  between  the  aircraft  response  xa(t)  and 
the  desired  response  x<j(t)  provided  by  the  performance  models  over  the  period,  T.  If  the 
performance  index  is  averaged  over  all  selected  design  points  P  in  the  envelope,  the 
average  system  error  (EaS)  is  obtained: 


Eas  -  2(Enns)/P  (2) 

The  Eas  has  been  previously  referred  to  as  the  "grade"  of  the  aircraft  controller. 
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6)  The  Neural  Network  chosen  for  the  control  law  synthesis  concept  Is  the  multilayer 
feedforward  perceptron  network.  This  type  of  network  is  Illustrated  schematically  in 
figure  4  and  is  typically  composed  of  sets  of  nodes  (processors)  arranged  in  layers.  The 
output  of  each  node  in  the  previous  layer  is  then  connected  to  the  input  of  each  node  in 
the  subsequent  layer  through  an  adjustable  weight  known  as  the  synaptic  weight. 
Layers  between  the  output  layer  and  the  input  layer  are  called  hidden  layers.  The 
adjustable  weights  are  intended  to  amplify,  attenuate  or  inhibit  transmission  of  nodal 
outputs  from  one  layer  to  the  next.  The  net  input  to  each  node  in  a  layer  is  the  sum  of 
the  weighted  outputs  of  every  node  in  the  previous  layer  (equation  3)  with  the  exception 
of  the  input  layer  [1  ].  A  nodal  output  is  then  determined  by  the  activation  function  of  the 
particular  node  which  maps  its  net  input  plus  a  node  'bias'  into  the  output  of  the  node. 
Although  it  is  not  a  requirement,  all  nodes  within  a  network  typically  possess  the  same 
activation  function.  For  the  case  of  our  multilayer  network,  an  activation  function  known 
as  the  sigmoid  function,  given  by  equation  (4)  was  chosen. 

netinj  =  EWyOi  (3) 

Oj  =  1/(1  +  e‘(netini  +  ei)/®0]  (4) 

Where  Wy  is  the  synaptic  weight  connecting  the  output  of  i^1  node  in  layer  N-1  to  the 
input  of  node  j  in  layer  N.  6j  is  the  bias  input  and  8o  is  the  temperature  coefficient  used 
to  modify  the  shape  of  the  sigmoid  at  node  j. 

The  neural  network's  main  function  in  the  control  law  synthesis  concept  is  to  develop  for 
the  aircraft  controller  a  set  of  optimal  gains  for  each  design  point  in  the  envelope.  Trim 
point  parameters  are  furnished  as  inputs  and  the  network  transmits  gains  to  the  aircraft 
controller  via  its  output  layer. 

7)  The  Training  Algorithm  is  the  vehicle  by  which  synaptic  weights  and  node  biases  of  a 
particular  neural  network  can  be  determined  so  that  the  network  provides  a  specific 
functional  mapping.  The  training  algorithm  can  be  thought  of  as  a  neural  network 
'designer.'  Back-Error-Propagation  (BEP)  was  one  of  the  first  effective  training 
algorithms  for  multilayer  perceptron  networks  and  is  still  used  in  many  applications 
because  of  its  straight-forward  implementation.  BEP  in  the  form  used  in  this  study  was 
introduced  by  Rumelhart,  Hinton  and  Williams  [8J,[9]  in  1986.  BEP  learning  is  an 
iterative  gradient  algorithm  that  modifies  the  synaptic  weights  throughout  a  neural 
network  to  minimize  the  error  between  the  actual  output  of  the  network  and  its  desired 
output  which  is  determined  by  the  user  [10], [11].  For  the  concept  presented  in  this 
study,  the  neural  network  outputs  are  the  aircraft  controller  gains  which  are  not  known 
apriori  to  training.  Therefore,  the  algorithm  must  be  modified  slightly  to  remain  useful  in 
this  application. 

The  desired  synaptic  weights  for  the  network  are  achieved  by  making  incremental 
changes  to  the  weights  according  to  the  following  equation: 

Wij(n+1)  =  Wy(n)  +  r?5jXi  (5) 

Where  Wy(n)  is  the  old  synaptic  weight,  Wy(n+ 1)  is  the  new  weight  and  r?  is  the  training 
constant.  Oi  is  the  output  of  node  i  in  a  hidden  layer  (or  the  Ith  input  lj,  when  node  j  lies  in 
the  first  hidden  layer).  6  j  is  the  error  or  gradient  term  for  node  j.  If  node  j  is  an  output 
node,  then 


6 j  =  Oj(l  -  Oj)  (-6Eas/«SOj)  (6) 
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In  the  standard  implementation  of  the  BEP  algorithm,  the  term  -S  Eas /S  Oj  in  equation  6 
would  be  replaced  by  the  error  between  the  desired  output  and  the  actual  network 
output  (dj  •  Oj).  As  mentioned  before,  the  desired  output  is  unknown  in  our  application 
but,  the  gradient  of  the  average  system  error  (eq.  2)  with  respect  to  the  network  outputs 
(6  E u/6  Oj)  can  be  determined  numerically. 

If  node  ]  lies  within  the  hidden  layers  of  the  network,  then 

Sj  =  Xj(l  -  Xj)S6kWjk  ,  (7) 

k 

where  k  is  over  all  nodes  in  the  layer  directly  following  node  j. 

Using  the  recursive  equation  (5)  with  equations  (6)  and  (7),  the  algorithm  starts  at  the 
output  nodes  O,  and  works  its  way  back  through  the  successive  hidden  layers.  Node 
biases  are  adapted  in  a  nearly  identical  fashion  by  assuming  they  are  connection  weights 
from  nodes  with  constant  valued  outputs. 

Finally,  a  momentum  term  is  added  to  equation  (5)  to  accelerate  convergence  of  the 
algorithm  to  the  desired  solutions  [1],[10],[11].  The  term  suggests  that  the  synaptic 
weight  change  which  occurs  from  training  cycle  n  to  n+1  is  somewhat  related  to  the 
previous  weight  change  from  training  cycle  n-1  to  n.  The  proportionality  constant  a 
determines  the  amount  of  the  previous  weight  change  that  will  be  used.  Incorporating 
the  momentum  term,  equation  (5)  becomes 

Wjj(n+1)  =  Wjj(n)  +  r)5jXi 

+  a[Wii(n)  -  Wjj(n-l)  ]  ,  (8) 

where  0  <  a  <  t.  Equation  (8)  is  the  expanded  equation  implemented  in  our  training 
algorithm. 

The  flight  control  law  synthesis  concept  as  shown  in  figure  3  is  very  similar  in  structure  to 
the  popular  adaptive  control  technique  known  as  the  Model  Referencing  Adaptive  Controller 
(MRAC).  Both  the  MRAC  and  the  neural  network  concept  described  in  this  report  use  a 
performance  model  (or  command  model)  and  an  adaptation  scheme  to  achieve  controller  gain 
adjustments.  There  are  however,  some  major  distinction  between  the  two  approaches: 

1)  The  MRAC  operates  on-line  (i.e.  while  the  aircraft  is  In  flight)  and  therefore,  must  provide 
stability  as  well  as  proper  flying  qualities  from  the  onset.  On  the  other  hand,  the  neural 
network  concept  described  herein  operates  off-line  (as  a  control  designer)  and  can  learn 
by  experience  which  gains  stabilize  the  aircraft  and  provide  adequate  handling 
characteristics. 

2)  The  neural  network  incorporates  memory  Into  the  concept  in  the  form  of  synaptic  weights. 
This  is  an  important  difference  because  the  MRAC  must  constantly  adjust  the  controller 
gains  as  the  aircraft  progresses  through  the  envelope.  The  neural  network  concept 
however,  remembers  the  gains  at  a  particular  trim  point  and  makes  adjustments  as 
needed.  Once  the  network  has  arrived  at  the  proper  gains,  the  training  algorithm 
essentially  halts  until  an  event  occurs  (e.g.  change  in  aircraft  dynamics)  which  produces 
an  output  error  between  the  aircraft/controller  and  the  performance  model. 
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Operation  of  the  flight  control  law  synthesis  concept  is  summarized  in  the  following  steps: 


Step  1.  Initialize  Network  Weights  and  Biases.  Randomly  set  all  synaptic  weights  and 
node  biases  to  small  values. 

Step  2.  Trim  aircraft.  Set  aircraft  at  trim  point  to  begin  design. 

Step  3.  Calculate  the  Controller  Gains.  Forward  propagate  the  set  of  trim  parameters 
(input)  via  equations  3  and  4  through  the  neural  network  and  transfer  the  outputs 
to  the  aircraft  controller. 

Step  4.  Apply  Test  Input.  Initiate  the  test  maneuver  (e.g.  pitch  doublet)  for  the 
aircraft/controller  and  the  performance  model 

Step  5.  Evaluate  Performance  of  Controller.  Calculate  the  root  mean  square  error 
(Erms  -  eq.  1)  between  the  aircraft  and  performance  model. 

Step  6.  Repeat  Steps  2  Through  5.  Repeat  these  steps  for  each  trim  point  in  the 
design  envelope. 

Step  7.  Grade  the  Neural  Network.  With  Erms  calculations  from  step  5  determine  the 
average  system  error  (Eas  -  eq.  2)  and  gradients  for  training. 

Step  8.  Train  the  Neural  Network.  Adjust  the  synaptic  weights  and  node  biases  of  the 
network  (eq.  5  -  8)  using  BEP  to  reduce  the  Eas. 

Step  9.  Repeat  Procedure.  Repeat  steps  2  through  7  until  the  Eas  has  fallen  below  a 
prescribed  tolerance. 

RESULTS 

The  flight  control  law  synthesis  concept  and  neural  network  algorithms  aforementioned 
were  coded  in  software  and  numerous  simulations  were  performed  to  examine  the  effectiveness  of 
this  approach.  As  an  initial  effort,  very  simplified  models  and  control  structures  replaced  the 
aircraft  model  and  flight  control  system.  In  addition,  the  performance  models  were  also  replaced 
by  arbitrary  models  not  intended  in  any  way  to  describe  actual  aircraft  handling  characteristics  as 
in  Mil-F-8785C.  This  section  contains  some  of  the  results  of  these  computer  simulations  which 
were  designed  to  support  the  basic  ideas  of  the  neural  network/flight  control  law  synthesis 
concept. 

The  first  simulation  used  a  second  order  Single  Input  Single  Output  (SISO)  linear  system 
in  place  of  the  aircraft  model.  The  transfer  function  of  the  model  is  given  by  equation  (9). 

Hi(s)  =  1/  ( s2  +  2s  +  1)  (9) 

This  transfer  function  represents  a  critically  damped  system  with  a  damping  ratio  of  1 .0  and  a 
natural  frequency  of  1.0  rad /sec.  The  control  structure  for  this  example  is  illustrated  in  figure  5. 
The  controller  provides  proportional  output  and  rate  feedback  via  gains  Kf  and  «2  and  a 
proportional  input  gain  K3.  An  arbitrary  SISO  linear  system  has  been  chosen  as  the  performance 
model  for  this  example  with  a  transfer  function  given  by: 

H2(s)  =  4/ (s2  +  2.82s  +  4)  (10) 
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The  simulation  was  configured  according  to  the  following  specifications: 


Test  input 

unit  step 

Plant  Model 

Hi(s) 

(aq.9) 

Controller 

figure  5; 

3  design  points* 

Performance  Model  : 

Ha(s) 

<aq.  10) 

Cost  Functional  ; 

Erma 

Eas 

MM 

Training  Algorithm 

back-error  propagation 

Neural  Network  : 

2  Inputs  (plant  initial  conditions) 

3  hidden  layers 
layer  1-10  nodes 
layer  2-12  nodes 
layer  3  - 15  nodes 

3  outputs  (gains) 

*Note  that  a  set  of  three  different  initial  condition  vectors  [xi  (0)  x2(0)]^  were 
used  as  inputs  to  the  neural  network  and  therefore,  formed  three  design  points 
for  the  controller. 


Two  separate  trials  were  run  for  the  first  simulation  and  the  initial  controller  gains  were 
changed  between  trials.  For  each  of  the  trials,  the  initial  conditions  and  gains  were  set  to: 

Trial  #1:  (xt  =  0.0,  X2  =  0.0)  ->  (Ki  =  0.0,  K2  =  0.0,  K3  =  1.0) 

(x,  =  0.1,  X2  =  0.3)  ->  (Ki  =  0.0,  K2  =  0.0,  K3  =  1.0) 

(xi  =  0.5,  x2  =  0.75)  ->  (Kt  =  0.0,  Kz  =  0.0,  K3  =  1.0) 

initial  cost:  0.1881 

Trial  #2:  (xt  =  0.0,  x2  =  0.0)  ->  (Ki  =  2.0,  K2  =  5.0,  K3  =  5.0) 

(xt  =  0.1,  x2  =  0.3)  ->  (Ki  =  2.0, Kz  =  5.0,  K3  =  5.0) 

(xi  =  0.5,  x2  =  0.75)  ->  (Ki  =  2.0, 1^  =  5.0,  K3  =  5.0) 

initial  cost:  0.152 

The  gains  Ki,  Kz  and  K3  were  adjusted  dynamically  by  the  neural  network  throughout  the 
simulation  so  that  the  plant/controller  response  began  to  match  more  closely  the  response  of  the 
performance  model  in  each  successive  training  cycle.  The  results  of  both  simulation  trials  are 
listed  below: 

Trial  #1:  (xi  =  0.0,  x2  =  0.0)  ->  (Ki  =  -  0.226, Kz  =  1.495,  Ka  =  2.443) 

(Xi  =  0.1,  Xa  =  0.3)  ->  (Ki  =  -  0.226,  Ka  =  1.492,  K3  =  2.438) 

(Xi  =  0.5,  X2  =  0.75)  ->  (Ki  =  -  0.225,  K2  =  1.488,  K3  =  2.431) 

training  cycles:  200 
final  cost:  0.03876 
performance  improvement:  79.39% 
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Trial  #2:  (xi  =  0.0,  X2  =  0.0)  ->  (Ki  =  1.386,  K2  =  3.611,  K3  =  4.645) 

(xi  =  0.1.X2  =  0.3)  ->  (Ki  =  1.385,  K2  =  3.610,  K3  =  4.646) 

(xi  =  0.5,  X2  =  0.75)  ->  (Ki  =  1.385,  K2  =  3.610,  K3  =  4.648) 

training  cycles:  55 
final  cost:  0.009869 
performance  improvement:  93.51% 

Figures  6  and  7  show  the  initial  and  final  responses  for  each  trial  run  at  the  first  initial 
condition.  The  desired  responses  are  identified  by  a  dashed  line  and  the  actual  plant /controller 
responses  by  the  solid  line.  It  should  be  noted  that  trial  #2  arrived  at  a  better  solution  quicker  th.  : 
trial  #1.  This  result  is  primarily  due  to  the  choice  of  Initial  gains  for  each  trial  run.  The  first  set  of 
initial  gains  caused  the  network  and  training  routine  to  progress  to  a  local  minimum  at  Ki  =  -0.226, 
K2  =  1.492,  K3  =  2.438).  Whereas,  the  second  set  of  initial  gains  caused  the  network  to  progress 
to  a  lower  minimum  at  Ki  =  1.385,  K2  =  3.610,  K3  =  4.646).  It  should  also  be  noticed  that  the 
network  arrived  at  essentially  the  same  gains  for  each  initial  condition.  This  is  a  correct  result 
because  it  can  be  shown  that  a  linear  plant  (as  in  the  first  simulation)  requires  only  a  linear  control 
system  (i.e.  one  set  of  gains  for  all  conditions)  to  produce  the  required  response. 

The  final  set  of  simulations  also  used  a  second  order  SISO  system  in  place  of  the  aircraft 
model.  However,  a  nonlinearity  was  incorporated  into  the  model  to  provide  the  neural 
network/flight  control  law  'nthesis  concept  with  a  more  realistic  problem.  Nonlinearities  in  plant 
dynamics  can  have  many  forms.  A  rate  limiter  was  included  in  this  particular  example  to  represent 
a  typical  nonlinear  system.  The  linearized  transfer  function  of  the  model  is  given  by  equation  (11). 

Hi(s)  =  4/ (s2  +  1.2s  +  4)  (11) 

This  transfer  function  represents  a  underdamped  system  with  damping  ratio  of  0.3  and  a  natural 
frequency  of  2.0  rads/sec.  The  control  structure  and  performance  model  for  this  example  are 
identical  to  those  presented  in  the  previous  example. 

This  simulation  was  configured  according  to  the  following  specifications: 


Test  Input 

unit  step 

Plant  Model 

nonlinear; 
rate  limited 

(eq.  11) 

Controller 

figure  5; 

3  design  points 

Performance  Model  : 

H2(s) 

(«?•  10) 

Cost  Functional 

Erms 

(aq.  1) 

Eas 

(•q  2) 

Training  Algorithm 

back-error  propagation 

Neural  Network 

2  Inputs  (plant  initial  conditions) 

2  hidden  layers 
layer  1  -  5  nodes 
layer  2-10  nodes 

3  outputs  (gains) 
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As  in  the  first  simulation  example,  two  separate  trial  run  were  made  and  the  initial 
controller  gains  were  changed  between  trials.  For  the  each  of  the  trials,  the  initial  conditions  and 
gains  were  set  to: 

Trial  #1:  (xi  =  0.0,  x2  =  0.0)  ->  (Ki  =  0.0,  K*  =  0.05,  K3  =  1.0) 

(X!  =  0.1,  X2  =  0.3)  ->  (Ki  =  0.0,  K2  =  0.05,  K3  =  1.0) 

(xt  =  0.5,  X2  =  0.75)  ->  (Ki  =  0.0,  K2  =  0.05,  K3  =  1.0) 

Initial  cost:  0.09864 

Trial  #2:  (xi  =  0.0,  X2  =  0.0)  ->  (Ki  =  0.9,  K2  =  2.05,  K3  =  2.75) 

(xt  =  0.1,  X2  =  0.3)  ->  (K}  =  0.9,  K2  =  2.05,  K3  =  2.75) 

(xi  =  0.5,  X2  =  0.75)  ->  (Ki  =  0.9,  K2  =  2.05,  K3  =  2.75) 


initial  cost:  0.1080 

Again,  the  gains  Ki,  K2  and  K3  were  adjusted  dynamically  by  the  neural  network  throughout  the 
simulation  so  that  the  plant/controiler  response  began  to  match  more  closely  the  response  of  the 
performance  model  in  each  successive  training  cycle.  The  results  of  both  simulation  trials  are 
lie*' oeiow: 

Trial  #1:  (xi  =  0.0,  X2  =  0.0)  ->  (Ki  =  0.754,  K2  =  1.279,  K3  =  2.313) 

(xi  =  0.1,  x2  =  0.3)  ->  (Ki  =  0.741, 1^  =  1.271,  K3  =  2.293) 

(xi  =  0.5,  x2  =  0.75)  ->  (Ki  =  0.724,  K2  =  1.262,  K3  =  2.270) 

training  cycles:  100 
final  cost:  0.04415 
performance  improvement:  55.24% 

Trial  #2:  =  0.0,  x2  =  0.0)  ->  (Ki  =  1.033,  K2  =  2.593,  K3  =  3.593) 

(xi  =  0.1,  x2  =  0.3)  ->  (Ki  =  1.039,  K2  =  2.616,  K3  =  3  627) 

(xi  =  0.5,  x2  =  0.75)  ->  (Ki  =  1.052,  K2  =  2.654,  K3  =  3.686) 

training  cycles:  50 
final  cost:  0.0443 

performance  improvement:  58.98% 

Figures  8  and  9  show  the  initial  and  final  responses  for  each  trial  run  of  the  nonlinear 
simulation  example.  The  desired  responses  are  again  identified  by  a  dashed  line  and  the  actual 
plant/controller  responses  by  the  solid  line.  In  this  example,  the  concept  concluded  with  roughly 
the  same  amount  of  success  at  both  sets  of  initial  gains.  There  is  an  initial  mismatch  between  the 
desired  and  actual  responses  in  both  cases  due  to  the  rate  limiting  imposed  by  the  plant  dynamics. 
It  is  interesting  to  note  that  the  neural  network  did  not  attempt  to  drive  up  the  feedback  gains  of  the 
controller  to  cvercompensate  for  this  nonlinear  rate  limit.  The  network,  however,  realized  the 
physical  limitations  of  the  problem  and  attempted  to  provide  the  best  possible  solution. 

CONCLUSIONS 


Based  on  an  analysis  of  the  results  obtained  from  the  computer  simulations  and 
discussed  in  the  previous  section,  some  concluding  statements  can  be  made  about  our  neural 
network/flight  control  law  synthesis  concept: 

1)  The  neural  network,  working  within  our  concept,  was  able  to  produce  feasible  controller 
gains  which  improved  the  plant /controller  response  with  respect  to  the  performance  model. 
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2)  The  neural  network  was  able  to  accomplish  (1)  without  any  prior  knowledge  of  the  plant, 
controller  or  performance  model  dynamics. 

3)  The  neural  network  did  not  supply  controller  gains  which  would  cause  the  plant/controller 
to  become  unstable  or  diverge  from  the  performance  model. 

4)  The  neural  network  provided  better  results  faster  when  the  initial  gains  were  dose  to 
optimum. 

5)  The  neural  network  will  not  always  converge  to  an  optimal  solution.  The  network  gives  no 
indication  that  it  has  arrived  at  either  an  optimal  or  sub-optimal  solution. 

6)  The  neural  network  did  not  attempt  to  overcompensate  for  limitations  imposed  by 
nonlinearities  within  the  plant  (e.g.  continually  increase  a  feedback  gains  when  the 
ptant/controller  is  rate  saturated). 

7)  The  neural  network  did  not  require  specific  knowledge  regarding  the  source  or  location  of 
the  error  (i.e.  steady  state  error,  peak  overshoot  error,  etc.)  but  was  able  to  increase  the 
plant/controller  performance  based  on  Eims  information  alone. 

In  light  of  the  preceding  condusions,  a  neural  network  operating  in  an  environment  such 
as  the  one  describe  herein  may  offer  several  advantages  over  conventional  controller  design 
methods.  Such  advantages  include:  1)  a  simplified  design  procedure  due  to  the  networks  ability  to 
produce  optimal  gains  without  specific  knowledge  of  aircraft  dynamics,  2)  a  more  accurate 
controller  design  due  to  the  networks  ability  to  handle  a  large  number  of  parameters  and  many 
design  points  and  3)  a  more  robust  controller  due  to  the  generalization  properties  of  the  network. 
However,  it  should  be  noted  that  the  neural  network  field  is  far  from  mature  and  many  sources  of 
uncertainty  still  exist  which  inhibit  a  complete  understanding  of  their  operation  within  application 
such  as  the  control  law  synthesis  concept.  And  although  reasonable  results  were  obtained  for  the 
selected  linear  and  nonlinear  examples  herein,  a  great  effort  and  challenge  still  remains  to  apply 
such  a  concept  to  the  highly  complex  dynamics  of  an  aircraft  model. 

FUTURE  WORK 

In  the  future,  work  of  this  nature  shall  progress  in  the  following  direction: 

1)  Software  integration  of  a  nonlinear  three  degree-of-freedom  aircraft  simulation  model.  The 
model  shall  include  force  and  moment  equations  for  the  longitudinal  axis  of  an  F/A-18 
aircraft  with  aerodynamic  data  for  the  full  maneuvering  envelope. 

2)  Develop  a  set  of  performance  models  via  Mil-F-8785C  for  the  longitudinal  axis  of  a  Class  IV, 
Category  A  high  maneuverability  fighter/attack  aircraft  with  Level  I  handling  qualities. 

3)  Conduct  experiments  with  3-DOF  aircraft  model  and  the  neural  network/flight  control  law 
synthesis  concept  via  repeated  computer  simulation  and  analysis. 

4)  Evaluate  performance  of  aircraft  and  neural  network  generated  controller. 

5)  Investigate  potentials  for  improving  concept  and  possibilities  of  implementing  variations  of 
this  concept  in  an  on-line  adaptive/leaming  control  scheme. 


9 


Linear 


NADC-91004-60 


FIGURE  1:  CONVENTIONAL  POINT  DESIGN  1  FIGURE  2:  NEURAL  NETWORK  APPROACH 
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FIGURE  3:  FLIGHT  CONTROL  LAW  SYNTHESIS  CONCEPT 
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FIGURE  8:  INITIAL  AND  FINAL  RESPONSES  FOR  FIGURE  9:  INITIAL  AND  FINAL  RESPONSES  FOR 

NONLINEAR  EXAMPLE  -  TRIAL  #1  NONLINEAR  EXAMPLE  -  TRIAL  #2 
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